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Abstract 
^ Data from a network of gravitational wave detectors can be analyzed in coincidence to increase 

detection confidence and reduce non-stationarity of the background. We propose and explore a 
geometric algorithm to combine the data from a network of detectors. The algorithm makes optimal 

o 

use of the variances and covariances that exist amongst the different parameters of a signal in a 






coincident detection of events. The new algorithm essentially associates with each trigger ellipsoidal 

regions in parameter space defined by the covariance matrix. Triggers from different detectors are 

deemed to be in coincidence if their ellipsoids have a non-zero overlap. Compared to an algorithm 

that uses uncorrelated windows separately for each of the signal parameters, the new algorithm 
> 

greatly reduces the background rate thereby increasing detection efficiency at a given false alarm 

2£ rate. 
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I. MOTIVATION 

Long baseline interferometric gravitational wave detectors, such as the Laser 
Interferometer Gravitational- Wave Observatory (LIGO) [1], Virgo [2], and GEO 600 [3], 
are currently acquiring the best data ever. The data sets from the different detectors can 
be either brought together and analyzed phase coherently [4, 5, 6, 7], or analyzed separately 
followed up by a coincidence analysis [6, 8, 9, 10, 11, 12, 13, 14, 15] of the triggers obtained. 
Coherent analysis maximizes signal visibility (i.e., gives the best possible signal-to-noise 
ratio in the likelihood sense) while the goal of coincidence analysis is to reduce and mitigate 
the non-stationary and non-Gaussian background noise. A recent comparison of coherent 
analysis vis-a-vis coincidence analysis under the assumption that the background noise is 
Gaussian and stationary has concluded that coherent analysis, as one might expect, is far 
better than coincidence analysis [16]. However, there are two reasons why current data 
analysis pipelines prefer the latter over former. Firstly, since the detector noise is neither 
Gaussian nor stationary, coincidence analysis can potentially reduce the background rate 
far greater than one might think otherwise. Secondly, coherent analysis is computationally 
far more expensive than coincidence analysis and it is presently not practicable to employ 
coherent analysis. 

A. The problem of coincident detection 

In coincidence analysis (see for example, Refs. [11, 12, 13, 14, 17, 18]), data sets from 
each detector will be analyzed separately and the triggers from the end of the pipeline 
from different detectors compared with one another to identify triggers that might be in 
coincidence with one another. More precisely, the goal is to find if the parameters of a 
trigger (e.g., in the case of a coalescing binary the time of merger, the component masses 
and spins) from one detector are identical to those from another. Since the presence of noise 
causes errors in the measurement of parameters of an inherent signal, it is highly improbable 
that the same gravitational wave in different detectors can be associated with exactly the 
same set of parameters. However, it should be possible to detect signals in coincidence 
by demanding that the measured parameters lie in a sufficiently small range of each other 
[11, 12, 13, 14, 17, 18]. Thus, we can revise the coincidence criteria as follows: triggers from 



different detectors are said to be in coincidence if their parameters all lie within a certain 
acceptable range. Events that pass the coincidence test are subject to further scrutiny but 
we shall focus in this paper on the coincidence test itself. 

From the above discussion it is clear that an important aspect of coincidence analysis 
is the determination of the range of parameter values to be associated with each trigger. 
To this end, until recently, the LIGO Scientific Collaboration (LSC) has deployed a 
phenomenological method for assigning the ranges [11, 12, 13, 14, 17, 18]. More precisely, 
one performs a large number of simulations in which a signal with a known set of parameters 
is added in software to the data which is then passed through the analysis pipeline. The 
pipeline identifies the most probable parameters with each injected signal and the ensemble 
of injected and measured parameters gives the distribution of the errors incurred in the 
measurement process. Given the distribution of the errors, one can choose a range for each 
parameter such that more than, say, 95% of the injected signals are detected in coincidence. 
Choosing wider windows will enable greater detection probability but also increases the 
rate of accidental triggers. On the contrary, smaller windows decrease the false alarm 
rate but also reduce the detection probability. Recently, a Bayesian coincidence test has 
been proposed [19] as an alternative wherein one computes the likelihood of a candidate 
event as belonging to a distribution obeyed by true signals rather the noise background. 
Unfortunately, measuring the distribution function when the parameter space is large could 
be computationally formidable [19] except when the parameters are independent. 

B. A geometric approach to choosing coincident windows 

In this paper we propose a new algorithm based on the metric (equivalently, the 
information matrix) defined on the signal manifold. The idea is very simple, even obvious, 
but leads to a great reduction in the background trigger rate. The advantages of the 
new algorithm are better appreciated by listing certain drawbacks of the phenomenological 
method. The drawbacks are quite naturally remedied in the new approach. 

First, because the current method uses rectangular windows it ignores the correlations 
between different parameters. For instance, in the case of a chirping signal from a black 
hole binary the shape of the signal depends, among others, on the component masses. 
However, not all combinations of the two masses lead to signals that are easily distinguishable 



from one another. Indeed, at the lowest post-Newtonian order the waveform depends only 
on a certain combination of the masses called the chirp mass; binaries of different values 
for the two masses but the same chirp mass produce essentially the same signal. This 
degeneracy is broken when post-Newtonian corrections are included. Nevertheless, the two 
mass parameters continue to be highly correlated. 

The second drawback is that the method employs windows of the same size throughout the 
parameter space while we know that errors in the measurement of the parameters depends, 
in some cases quite sensitively, on the parameters. Drawing again from our example of a 
binary, the error in the estimation of the chirp mass can vary by more than two orders 
of magnitude across the parameter space of interest in the case of systems that LIGO is 
expected to observe (see, e.g., [20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30]). Clearly, it is not 
optimal to deploy windows of the same size all over the parameter space. 

Thirdly, by not taking into account parameter covariances, the method entails 
independent tuning of several parameters at the same time. This could be a horrendous 
problem when dealing with signals characterized by many parameters. For instance, 
continuous radiation from a pulsar is characterized by the location of the pulsar, its spin 
frequency, the derivative of the frequency and so on. These physical parameters are all not 
independent; the existence of covariances among them imply that the effect of variation 
in one parameter can be absorbed by another - thereby complicating the pipeline tuning 
procedure. In the case where parameters have perfect or near perfect covariances, variations 
of the parameters may not even lead to distinct signals at all. This further implies that it may 
not be necessary to tune each parameter separately, rather it should be enough to tune only 
a subset of the parameters or, more precisely, only the principal components. Furthermore, 
the method does not provide a unique set of windows, rather several possibilities could be 
worked out. 

Finally, by using windows of the same size irrespective of the signal-to-noise ratio (SNR) 
of the trigger, the method suffers from an undesirably high false alarm rate, particularly 
in the tail of the SNR distribution. Needless to say, a successful detection of gravitational 
waves necessitates as clean a distribution of the SNRs as possible, with little contamination 
of the tails. One way of reducing the false alarm rate is by using tighter windows at higher 
SNRs. This is well-motivated since true high-SNR events will be associated with smaller 
errors. 



The geometric algorithm proposed in this paper quite naturally overcomes the drawbacks 
of the phenomenological method. The algorithm takes into account the correlations amongst 
the various parameters and deploys parameter- and SNR-dependent ellipsoidal windows 
defined by the Fisher information matrix using a single parameter. The most important 
consequence of the new algorithm is a great reduction in the background rate. 

C. Organization of the paper 

In Sees. II, III and IV, we present and discuss the new algorithm to identify events in 
coincidence. The algorithm comprises of two steps. The first step consists in associating each 
trigger with a p-dimensional ellipsoid. In the second step one tests if the ellipsoid associated 
with a trigger from one detector overlaps, or at least touches, an ellipsoid associated with 
a trigger from another detector. In Sec. V we apply the algorithm developed in Sec. II to 
the case of a transient chirp signal from a binary black hole. This will help us assess the 
extent to which the algorithm is helpful in reducing the background. Sec. VI concludes by 
summarizing the application of the new algorithm in real data analysis pipelines and future 
prospects. 

II. A GEOMETRIC COINCIDENCE ALGORITHM 

This Section begins with a brief introduction to the geometric formulation of signal 
manifold and metric introducing the terminology needed in later Sections. The metric so 
defined helps us in identifying ellipsoidal regions with a given point on the manifold whose 
size is chosen depending on the signal-to-noise ratio (SNR) and the parameter space region 
where the point lies. As an exercise to estimate the efficacy of the new coincidence algorithm 
we then compare the volume of the ellipsoid with that of a proper rectangular box enclosing 
the ellipsoid and aligned along the coordinate lines. 

A. Scalar Product, Signal Manifold and Metric 

The problem of gravitational wave data analysis was addressed in a geometric framework 
with the intention of understanding parameter estimation [27, 28] and computational 



requirements for matched filtering [31, 32, 33]. In this framework, one thinks of the outputs 
of an ensemble of detectors as either finite- or infinite-dimensional vectors depending on 
whether one considers data streams as a discrete sampled set or the continuum limit of 
the same, respectively. For the sake of convenience, in this paper we shall deal with the 
continuum limit. However, all our results are applicable to the more realistic case in which 
detector outputs are treated as finite dimensional vectors. It is easy to see that the set 
of all detector outputs form a vector space satisfying the usual axioms of a vector space. 
The starting point of our discussion is the definition of the scalar product. Given any two 
functions x(t) and y(t), their scalar product (x, y) is defined as [20, 21, 22, 23] 

(x, !/)=2|°°^ [X(f) Y*(f) + X*(f)Y(f)\ , (2.1) 

where X(f) = j_ dtx(t) exp(—27rift) is the Fourier transform of the function x(t) (and 
similarly, Y(f)) and S h (f) is the one-sided noise power-spectral density of the detector. 
The scalar product in Eq. (2.1) is motivated by the likelihood of a known signal buried in 
Gaussian, stationary background [34]. 

Amongst all vectors, of particular interest are those corresponding to gravitational waves 
from a given astronomical source. While every signal can be thought of as a vector in the 
infinite-dimensional vector space of the detector outputs, the set of all such signal vectors 
don't, by themselves, form a vector space. One can immediately see that the norm of a 
signal h (i.e., the square-root of the scalar product of a signal with itself) gives the SNR p 
for a noiseless signal that is filtered using an optimal template [35, 36]: 



P = (h,h) l/2 
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where H(f) is the Fourier transform of the signal h(t). In particular, we can define signals 
h of unit norm: 

h= h =-, (h,h) = l. (2.3) 

The set of all normed signal vectors (i.e., signal vectors of unit norm) form a manifold, 
the parameters of the signal serving as a coordinate system [27, 28, 32, 33]. Thus, each 
class of astronomical source forms an n-dimensional manifold S n , where n is the number of 
independent parameters characterizing the source. For instance, the set of all signals from a 
binary on a quasi-circular orbit inclined to the line-of-sight at an angle l, consisting of non- 
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spinning black holes of masses mi, and m 2 , located a distance D from the Earth 1 initially in 
the direction (9, tp) and expected to merge at a time t c with the phase of the signal at merger 
(fCi forms a nine-dimensional manifold with coordinates {D, 9, ip, m 1; m 2 , t c , ipc, <-, 4>}, 
where ip is the polarization angle of the signal. In the general case of a signal characterized 
by n parameters we shall denote the parameters by p a , where a — 1, ... ,n. 

The manifold <S n can be endowed with a metric g a p that is induced by the scalar product 
defined in Eq. (2.1). The components of the metric in a coordinate system p a are defined 
by 2 

g al 3 = (dji, dfshj , dji = — . (2.4) 

The metric can then be used on the signal manifold as a measure of the proper distance 
d£ between nearby signals with coordinates p a and p a + dp a , that is signals h{p a ) and 
h(p a + dp a ) } 

d£ 2 = gapdp-dp . (2.5) 

Now, by Taylor expanding h{p a + dp a ) around p a , and keeping only terms to second order 
in dp a , it is straightforward to see that the overlap O of two infinitesimally close-by signals 
can be computed using the metric: 



0(dp a ; p a ) = (h(p a ), h(p a + dp° 

= i - y aP dp a d P , (2.6) 

The metric on the signal manifold is nothing but the well-known Fisher information 
matrix usually denoted r Q/3 , (see, e.g., [34, 37]) but scaled down by the square of the SNR, 
i.e., g a/ 3 = p~ 2 T a/ 3. The information matrix is itself the inverse of the covariance matrix Cap 
and is a very useful quantity in signal analysis. The ambiguity function A{dp a ; p a ), familiar 
to signal analysts, is the overlap function defined above: A(dp a ; p a ) ~ 0(dp a ; p a ). Thus, 
the equation 

A(dp a ; p a ) = e, or 0(dp a ; p a ) = e, (2.7) 

1 Even though we deal with normed signals (which amounts to fixing D), astrophysical gravitational wave 
signals are characterised by this additional parameter. 

2 We have followed the definition of the metric as is conventional in parameter estimation theory (see, e.g., 
Refs. [21, 22, 23, 28]) which differs from that used in template placement algorithms (see, e.g. Refs. [32]) 
by a factor of 2. This difference will impact the relationship between the metric and the match as will be 
apparent in what follows. 



where e (0 < e < 1) is a constant, defines the ambiguity surface, or level surface. In 
gravitational wave literature e, which measures the overlap between two mis-matched signals, 
is also called the match. Using the expression for the overlap O [cf. Eq. (2.6)] in Eq. (2.7), 
we can see that the coordinate distance dp a to the ambiguity surface from the coordinate 
point p a is related to the proper distance 3 by: 

g a0 dp a dp/ i = 2(l-e). (2.8) 

Equivalently, d£ = a/2(1 — e). For a given value of the match e the above equation defines 
a (n — l)-dimensional ellipsoid in the n-dimensional signal manifold. Every signal with 
parameters p a + dp a on the ellipsoid has an overlap e with the reference signal at p a . 

B. Coincidence windows 

Having defined the metric (equivalently, the information matrix) and the ambiguity 
function, we next consider the application of the geometric formalism in the estimation 
of statistical errors involved in the measurement of the parameters and then discuss how 
that information may be used in coincidence analysis. We closely follow the notation of Finn 
and Chernoff [21, 22, 23] to introduce the basic ideas and apply their results in the choice 
of coincidence windows. 

Let us suppose a signal of known shape with parameters p a is buried in background 
noise that is Gaussian and stationary. Since the signal shape is known one can use matched 
filtering to dig the signal out of noise. The measured parameters p a will, in general, differ 
from the true parameters of the signal 4 . Geometrically speaking, the noise vector displaces 
the signal vector and the process of matched filtering projects the (noise + signal) vector 
back on to the signal manifold. Thus, any non-zero noise will make it impossible to measure 
the true parameters of the signal. The best one can hope for is a proper statistical estimation 
of the influence of noise. 

The posterior probability density function V of the parameters p a is given by a multi- 



3 Here the proper distance refers to the distance between the signal h(p a ) at the coordinate point p a and 
a signal h(p a + dp a ) with coordinates p a + dp a on the ambiguity surface. 

4 In what follows we shall use an over-line to distinguish the measured parameters from true parameters 
p a . 



variate Gaussian distribution 5 : 
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where n is the number of parameters, Ap a = p a — p a , and C a /3 is the covariance matrix, C 
being its determinant. Noting that C~l = p 2 g a (3, we can re-write the above distribution as: 
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where we have used the fact that C = l/(p 2n g), g being the determinant of the metric g a p. 
Note that if we define new parameters p' a = pp a , then we have exactly the same distribution 
function for all SNRs, except the deviations Ap a are scaled by p. 

Let us first specialize to one-dimension to illustrate what region of the parameter space 
one should associate with a given trigger. In one-dimension the distribution of the deviation 
from the mean of the measured value of the parameter p is given by: 

Ap 2 ' 



dAp 
V(Ap)dAp = / __ exp 



2na 



2a 2 



p^dAp i p > 

exp( -—gppAp 



2n 
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where, analogous to the n-dimensional case, we have used a 2 = l/(p 2 g pp ). Now, at a given 
SNR, what is the volume Vp in the parameter space such that the probability of finding the 
measured parameters p inside this volume is PI This volume is defined by: 



P 



V{Ap)dAp. 



(2.12) 



ApeVp 



Although Vp is not unique it is customary to choose it to be centered around Ap = : 
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cxp 
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pJgp-pdAp ( p 2 g pp Ap 2 
— — exp I — 

p 2 9pp Ap 2 <r 2 (P) V27T V 2 



(2.13) 

where given P the above equation can be used to solve for r(P) and it determines the range 
of integration. For instance, the volumes Vp corresponding to P ~ 0.683, 0.954, 0.997, . . . , 
are the familiar intervals [—a, a], [—2a, 2a], [—3a, 3a], . . . , and the corresponding values of 



r are 1, 2, 3. Since a = l/y P 2 g PP we see that in terms of g pp the above intervals translate to 



1 



1 



9pp y9pp 



1 
P 



9pp y9pp 



9pp \J9pp 



(2.14) 



5 A Bayesian interpretation of T D (/\p a ) is the probability of having the true signal parameters to lie 
somewhere inside the ellipsoidal volume centered at the Maximum Likelihood point p a . In this case 
the overlap-test for determining coincidence test is motivated by a test of concordance that the true signal 
parameters p a should lie in the overlap region. 
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Thus, for a given probability P, the volume Vp shrinks as 1/p. The maximum distance <i max 
within which we can expect to find "triggers" at a given P depends inversely on the SNR p : 
d£ = ^gppAp 2 = r/p. Therefore, for P ~ 0.954, r = 2 and at an SNR of 5 the maximum 
distance is 0.4, which corresponds to a match of e = 1 — \d£ 2 = 0.92. In other words, in 
one-dimension 95% of the time we expect our triggers to come from templates that have 
an overlap greater than or equal to 0.92 with the buried signal when the SNR is 5. This 
interpretation in terms of the match is a good approximation as long as d£ -C 1, which will 
be true for large SNR events. However, for weaker signals and/or greater values of P we 
can't interpret the results in terms of the match although, the foregoing equation Eq. (2.12) 
can be used to determine r(P). As an example, at P ~ 0.997, r = 3 and at an SNR of p = 4 
the maximum distance is d£ = 0.75 and the match is e = 23/32 ~ 0.72, which is significantly 
smaller than 1 and the quadratic approximation is not good enough to compute the match. 
These results generalize to n dimensions. In n-dimensions the volume Vp is defined by 



P= V(Ap a )d n Ap. (2.15) 

J Ap Q eVp 

Again, Vp is not unique but it is customary to center the volume around the point Ap a = : 



p n y /gd n Ap •-- 



P = I — , W2 — exp 



^g a pAp a ApP 



(2.16) 



Given P and the parameter space dimension n, one can iteratively solve the above equation 
for r(P,n). The volume Vp is the surface defined by the equation 

J. (2.17) 

This is the same as the ellipsoid in Eq. (2.8) except that its size is defined by r/p. Let us 
note the generalization of a result discussed previously, namely that the size of the ellipsoid 
is not small enough for all combinations of P and p and, therefore, it is not always possible 
to interpret the distance from the centre of the ellipsoid to its surface in terms of the overlap 
or match of the signals at the two locations except when the distance is close to zero. This 
is because the expression for the match in terms of the metric is based on the quadratic 
approximation which breaks down when the matches are small. However, the region defined 
by Eq. (2.17) always corresponds to the probability P and there is no approximation here 
(except that the detector noise is Gaussian). 
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TABLE I: The value of the (squared) distance d£ 2 = r 2 /p 2 for several values of P and the 
corresponding smallest match that can be expected between templates and the signal at different 
values of the SNR. 



P = 0.683 



dl 2 



«MM 



P = 0.954 



dl 2 



£MM 



P = 0.997 



£MM 



n = 1 



5 


0.04 0.9899 


0.16 0.9592 


0.36 0.9055 


10 


0.01 0.9975 


0.04 0.9899 


0.09 0.9772 


20 


0.0025 0.9994 


0.01 0.9975 


0.0225 0.9944 



n 



5 


0.092 0.9767 


0.2470 0.9362 


0.4800 0.8718 


10 


0.023 0.9942 


0.0618 0.9844 


0.1200 0.9695 


20 


0.00575 0.9986 


0.0154 0.9961 


0.0300 0.9925 



n 



5 


0.1412 0.9641 


0.32 0.9165 


0.568 0.8462 


10 


0.0353 0.9911 


0.08 0.9798 


0.142 0.9638 


20 


0.00883 0.9978 


0.02 0.9950 


0.0355 0.9911 



When the SNR p is large and 1 — P is not close to zero, the triggers are found from the 
signal with matches greater than or equal to 1 — r 2 f 1 ' . Table I lists the value of r for several 
values of P in one-, two- and three-dimensions and the minimum match 6mm for SNRs 5, 
10 and 20. Table I should be interpreted in the light of the fact that triggers come from an 
analysis pipeline in which the templates are laid out with a certain minimal match and one 
cannot, therefore, expect the triggers from different detectors to be matched better than the 
minimal match. 

From the Table, we see that when the SNR is large (say greater than about 10) the 
dependence of the match 6mm on n is very weak; in other words, irrespective of the number 
of dimensions we expect the match between the trigger and the true signal (and for our 
purposes the match between triggers from different instruments) to be pretty close to 1, 
and mostly larger than a minimal match of about 0.95 that is typically used in a search. 
Even when the SNR is in the region of 5, for low P again there is a weak dependence of 
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6mm on the number of parameters. For large P and low SNR, however, the dependence of 
£mm on the number of dimensions becomes important. At an SNR of 5 and P ~ 0.997, 
e M M = 0.91, 0.87, 0.85 for n = 1, 2, 3 dimensions, respectively. 

In general, for a given probability P the size of the ellipsoid at an SNR p is smaller by 
a factor p compared to that at p = 1. Thus, the volume in the parameter space in which 
the measured parameters will lie at a given probability P will scale with the SNR as p~ n . 
Therefore, if the goal of an experiment is to have false dismissal probability that is no greater 
than 1 — P then the ellipsoidal windows given by Eq. (2.17) could be employed when testing 
triggers from different detectors for coincidences. We now have our first result which states 
that: 

When performing coincidence analysis of triggers one should test to see if the 
associated ellipsoids overlap with each other. These ellipsoids describe the 
smallest possible volume within which the false dismissal probability is no more 
than a pre- specified value. 

Notice also that, if one assumes that false alarms are due to accidental coincidences between 
triggers which are otherwise uncorrected 6 , the false alarm rate would then also go down 
by p~ n . Thus, given the false dismissal probability 1 — P the size of the ellipsoid further 
depends on the SNR of the events that are being subject to coincidence analysis, the size 
shrinking sharply as a function of the event's SNR. Thus we have the second of our results: 

The size of the ellipsoids should be chosen in inverse proportion to the signal-to- 
noise ratio. 

However, this latter feature has not yet been implemented in current gravitational wave 
searches and will be a priority for implementation in future versions of the search pipeline 
[38]. The final, and practically speaking probably the most important, result is the following: 

Our coincidence algorithm reduces the number of tunable parameters from n 
(where n is the number of parameters) to 1, irrespective of the dimensionality of 
the signal parameter space. 

6 This is not the case for co-located detectors such as the two LIGO Hanford interferometers. 
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This parameter u introduced in Eq. 3.2 essentially scales the volume of the ambiguity 
ellipsoid - the shape and orientation of which is entirely determined from the metric 
components. The appropriate value of this parameter can be determined by extensive 
Monte-Carlo tests where one injects GW signals in interferometer noise and by optimising 
the detection efficiency vis-a-vis false alarm rate, an acceptable value of « is arrived at. 
Having just one parameter greatly simplifies this tuning procedure. Note that as argued 
before « is SNR-dependent: loud signals with high SNRs are expected to be more consistent 
in their parameters in different detectors. Thus the ellipsoids associated with these high 
SNR triggers are expected to overlap (and hence pass coincidence) even if they each have a 
smaller volume. On the other hand, for weaker signals we need to associate larger ellipsoids 
in order for them to overlap. 

III. OVERLAP OF ELLIPSOIDS 

A key tool in determining coincidences of triggers from two or more detectors is a 
mathematical algorithm to determine if the ellipsoids associated with triggers either touch 
or overlap with each other. This algorithm forms the workhorse for identifying coincidence 
of triggers from two or more detectors. 

As stated in Section II A, triggers resulting from the analysis pipeline are projections of 
the data by normed signal vectors onto an n-dimensional space S n , where n is the number of 
independent parameters characterizing the source. In the foregoing Section we introduced 
ellipsoidal regions in the n-dimensional parameter space with their centers at the location 
of the maximum likelihood point. When we analyze the data, however, we will not know 
before hand if a signal is present in the data and even when there is one we would not know 
where its location in the parameter space is. We will have, nevertheless, the knowledge of the 
location of the triggers in the parameter space. Let us denote the coordinates of a trigger 
from a detector A as q% where a is the index on the parameter space. The coincidence 
analysis proceeds in the following manner. Define an ellipsoidal region £{j>a, g) around 
each trigger q\ by 

£(PA, g) = {pa e S n | (p A - q A ) T g (p A - q A ) ^ 1} , (3.1) 

where q^ e S n is the position vector of the center of the ellipsoid (i.e., the location of the 
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trigger from detector A) and g is the rescaled metric which we shall refer to as the shape 
matrix. It is related to the metric by 

9a(3 = V 2 9*i3 (3-2) 

where /i 2 is a numerical scaling factor used to expand the linear distances of the ellipsoid 
while holding the position of the center and the spatial orientation constant. Eq. (2.17) 
allows us to interpret the parameter \i in terms of the probability P with which the trigger 
can be expected to be found within the ellipsoid £(pa, g) '■ 

P 2 
v 2 = ^7j5y- ( 3 - 3 ) 

Further, the probabilities P associated with a given \i can be found using Eq. (2.16) when 
the background noise is Gaussian. However, most detector noise is non-Gaussian and non- 
stationary and in those cases /z serves as a parameter that must be tuned to achieve a certain 
detection efficiency or, alternatively, a certain false alarm rate. 

Thus, the shape matrix is the scaled metric and encodes the local correlations between 
the parameters in the neighborhood of the trigger center. It is trivial to check that when 
/i = 1, Eq. (3.1) defines the interior of the ambiguity ellipsoid previously defined in Eq. (2.7). 

Once an ellipsoidal model for the trigger is established, following [39] one can construct a 
contact function .Fab(A) of two ellipsoids £(qA, 9 a) an d £(s\b, 9 b) (defined around triggers 
from detectors A and B) as 

F AB {\) = A(l - A) r/ s [Xg- 1 + (1 - A)^ 1 ] -1 ^, (3-4) 

where tab = Qb ~ <\a and A e [0, 1] is a scalar parameter. The maximum of the contact 
function over A in the interval [0, 1] can be shown [39] to be unique. It can also be shown 
that for two overlapping ellipsoids, the maximum of the contact function is less than 1, i.e, 

F = max [.Fab (A)] < 1. (3.5) 

When F = 1, the two ellipsoids 'touch' each other externally. 

In the 'coincidence' data-analysis paradigm, given triggers from iV detectors (N > 2), 
one draws up a list of 'coincident triggers' for further analysis to test their significance. 
The simplest coincident triggers consist of those which have 'consistent' parameters in two 
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FIG. 1: Panel A plots the contact function Eq. (3.4) for two pairs of three-dimensional ellipsoids 
taken from a search for binaries consisting of non-spinning compact objects characterized by 
parameters (t c ,To,Ts) [see Sec. V, in particular Eq. (5.4)]. Panels B, C and D are the projections of 
the ellipsoids in (t c , To), (t c ,T3) and (ro,T3) orthogonal planes, respectively. Solid lines refer to the 
case of non-overlapping ellipsoids and dashed lines are for over-lapping (i.e., coincident) triggers. 
Note that in the latter case the maximum of the contact function is < 1, which is the test that is 
carried out to determine if a pair of triggers are in coincidence. 

detectors (two-way coincidence). Testing for two-way coincidences for triggers from co- 
located detectors (e.g., the two LIGO detectors at Hanford) can be accomplished by a single 
test of Eq. (3.5) on a pair of triggers. 

When the detectors are non-colocated, one needs to allow for a non-zero 'time-of-flight' 
delay between the trigger arrival times. One assumes that the GW signals travel at the speed 
of electro-magnetic radiation in vacuum c and the maximum allowed time delay is then set 
to ±A/c, where A is the distance between the two detectors. As far as the geometrical 
picture of the coincidence test is concerned, for the non-colocated case one needs to test 
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for the overlap of a 'cylindrical' volume (of length 2A/c along the time-dimension) and an 
ellipsoid 7 . In practice, however, the test can be carried out iteratively by adding discrete 
time delays to the trigger (spanning the allowed time-delay) from one detector and testing 
for the overlap condition against the trigger from the other detector, keeping the latter 
fixed in time. The discrete time step can be set to the inverse of the sampling frequency of 
the time series. The fact that the overlap-test is computationally cheap allows for such a 
brute-force implementation strategy to be viable. 

These 2-IFO coincident triggers can now be used as building blocks to construct 
more complex coincidence triggers that have consistent parameters over three or more 
interferometers (3-IFO, ..., n-IFO coincidence triggers). For example, the set of triggers 
{Ti,Tj,T k ) can be classified as 3-IFO coincident if (^,7)), {Tj,T k ) and {T h T k ) 2-IFO 
coincident pairs exist. Here again, the subscripts i ^ j ^ k are labels on interferometers. 
This idea can be generalised to determine the list of n-IFO coincident triggers given the list 
of (n — l)-IFO coincidences. It is useful to note that Eq. 3.5 is the only test we need in 
order to build the entire hierarchy of coincidence triggers. 

We conclude this Section by drawing attention to two practical issues in implementing 
this geometrical coincidence test. The first has to do with the algorithm one uses to draw 
up two-way coincidences. Given the set of triggers from two detectors, one can (a) work 
with time-ordered triggers and (b) find the maximum length of the bounding box of the 
ellipsoid along the time dimension over all the triggers such that for any trigger from one 
detector, the test for overlap is carried out only if a trigger from the other detector occurs at 
a time that is within twice this interval. This approach greatly reduces the overall number 
of overlap tests required to find two-way coincidences. The expression for the length of the 
sides of the bounding box can be algebraically determined given the shape matrix of the 
triggers and is explicitly given for 2- and 3-dimensions in the next Section. 

The second point is on the numerical implementation of the test of the overlap of ellipsoids 
where we maximize the contact function over a single parameter A. Evaluation of the contact 
function involves matrix inversion which can be quite expensive computationally. Under 
these circumstances, prior knowledge of the inverse of trigger shape matrices can prove to 



7 Note that, in the case of an externally triggered search, where the position of the source is known, we can 
use a fixed time delay for non-colocated detectors. 
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be more efficient than on-the-fly computation. Brent's minimization method [40, 41] is 
particularly suitable for fast convergence to the maxima given the well behaved nature of 
the contact function and is available as part of the GNU Scientific Library [42]. 

IV. EXPECTED REDUCTION IN FALSE ALARM RATE 

Next, let us consider the reduction in the false alarm rate as a result of using ellipsoidal 
windows as opposed to rectangular windows 8 . In order to achieve false dismissal probability 
less than or equal to 1 — P, a rectangular window has to be at least as large as the box 
that encloses the ellipsoid. Now the volume of an n-dimensional ellipsoid (n > 2) whose 
semi-axes are a^, k — 1, . . . , n, is given by a recursive formula: 

V n = 2 ^^ f[ o fc , where V = 1, V x = 2. (4.1) 

n - LJ - 

fc=i 

On the other hand, the smallest volume an n-dimensional box that encloses the ellipsoid 
would be 

n n 

U n = l[(2a k ) = 2 n l[a k , (4.2) 

fc=i fc=i 

where a factor of 2 arises since a k are semi-major axes and the side-lengths of the enclosing 

box will be twice that value. Thus, the rectangular box's volume is larger than that of the 

ellipsoid by the factor 

V n vrU n _ 2 
Thus, in 2-, 3- and 4-dimensions the saving is A/n, Q/ir and 32/7T 2 , respectively. However, 

the real factor could be far greater as the error ellipsoids are generally not oriented along 

the coordinate axes. 

When the ellipsoid is not aligned with the coordinate axes, which will be the case when 

there are correlations between the different parameters, the side-lengths of the bounding box 

are given by maximizing each coordinate axis over the entire ellipsoidal surface as follows. 

Starting from Eq. (2.17) one can express the first of the coordinates p = p 1 in terms of the 

other coordinates: 

g n p i + 2g li pp i + g ij p i p> - (-) =0, i,j = 2,...n, (4.4) 

8 This discussion again assumes that false alarms are due to accidental coincidences between otherwise 
uncorrelated triggers. 
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which can be solved to obtain 

1 



P± = — 
0n 



9ii V 1 ± y{9ii 9ij - 0n 9ij)P l P j + (0n r 2 / p 2 ) 



(4.5) 



For our purposes we only need the 'plus' solution. One can then set-up n — 1 equations in 
as many variables by demanding that dp + /dp k = 0, which gives 



{gugik - gu9ik){gijgik - gugjk) , J itrJ r 2 

VV =911-2- ( 4 - 6 ) 



(gugij - 9n9ij) 

9lk 



P 2 ' 



These are again quadratic equations that must be solved (simultaneously) for the coordinates 
pi, j = 2 . . . , n. The resulting (positive) roots, denoted p{ can be substituted in Eq. (4.4) 
to obtain the half-side-length of the ellipse. We shall next give explicit expressions for the 
side-lengths of the enclosing box in two and three dimensions. In higher dimensions the 
expressions are rather cumbersome but the general procedure outlined above can be used to 
compute the volume of the bounding box in all cases. 

The side-lengths of the bounding box are given in two dimensions by 

* = *M, y = 2M, (4.7) 

V \g\ V w 

and in three dimensions by: 

x = 2 



[923 922933J 922 



(912923 ~ 922913? ~ (023 - 02203s) (012 ~ 01102 2 ) ' 



_, = ., , (013-011033)011 



(012013 - 011023) 2 - (013 - 011033) (012 ~ 01102 2 ) ' 

' = 2 \/^^ (ti2- 9ii922)9ii __ (4g) 

(012013 - 011023) - (012 - 011022) (013 ~ 01103s) 

V. APPLICATION TO COALESCING BINARIES 

Inspiralling compact binaries are one of the most promising candidates for detection by 
the laser interferometric detectors. It will, therefore, be interesting to investigate the gains 
of using the new coincidence method in such searches. For the purpose of our discussion, it 
will suffice to use a simple model of the signal. We shall use the Fourier representation of the 
waveform from a binary consisting of non-spinning compact objects on a quasi-circular orbit 
in which post-Newtonian (PN) corrections to the amplitude are neglected, but corrections to 
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the phase are included to the desired order. This waveform is calculated using the stationary 
phase approximation, and is of the form: 

~ h{f) = ^S"Vll r7/6exp [**(/; fc>«k'*) +i i] > ^ 

*(/) = 2nft c + (f>c + Yl A ^ (fc_5)/3 ( 5 - 2 ) 

k 

Where D is the distance to the source, and A is a constant which depends on the relative 
orientations of the detector and the binary orbit, and tc and (fie are as defined in Section II A. 
Waveforms of this type at 2PN order [43, 44] have been used in previous searches for binary 
neutron star inspirals [12], and are currently being used in searches for compact binary 
inspirals with a total mass of < 35M [38]. Moreover, the metric computed for such a 
waveform has been shown to be approximately valid for a range of physical approximants [45, 
46]. At the 2PN order, the coefficients X k are given by the following expressions: 

3 5 /743 11 

~ 128r/(7rM) 5 / 3 ' Al ' 2 ~ 967T77M V336 + ~4 V 
-37T 1 / 3 _ 15 / 3058673 5429 617 2 \ 

3 ~ 8r]M 2 /3' 4 ~ 64r ? ( 7 rM 1 /3 1^1016064 + 1008^ + 144^ ) ' { ' 

where M is the total mass of the system, and r] is the symmetric mass ratio, which is defined 
as i] = m\m2J M 2 . 

The metric required for determining coincidence in the case of non-spinning binaries is 
that in the 3-dimensional space of (to, t , r 3 ), where r and r 3 are the chirp times, which are 
a convenient way of parameterizing the masses of the binary system. They are given by 

^o = ^-(vrM/ L )- 5 / 3 , r 3 = -^(7rMf L )-^, (5.4) 

2567r/ L ?7 8f L r] 

where fi is the frequency below which no appreciable signal can be detected due to rising 
detector noise at low frequencies. 

In obtaining the metric, it proves to be more convenient to use parameters {t c ,0 1 ,9 2 ), 
where d\ = 2iif L T , and 9 2 = 2irf L T 3 . This metric was obtained by Owen in [32]. Here, 
Eq. (2.6) was used, and the phase (fie maximized over to give the expression for the metric: 

9aP = \{J &aM ~ J [Va] J \M) > ( 5 - 5 ) 

where %fi a is the derivative of the Fourier phase of the inspiral waveform with respect to 
parameter 6 a . J is the moment functional of the noise PSD, which is defined for any 
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function a(x) as: 



1 f Xu a(x)x- 7 ? 3 , 

W'mL ^r dx - (5 - 6) 



/(g) is the gth moment of the noise PSD, which is defined by: 

r%u T -<?/3 
I(q) = S h (f ) / -^r-.dx, (5.7) 

Jx L &h\ x ) 

where x = f/fo, /o being a fiducial frequency used to set the range of the numerical values 
of the functions contained in the integrals. The value of xl is chosen so that the contribution 
to the integral for values below xl would be negligible, xu = fu/fo, where fu is the ending 
frequency of the inspiral waveform in question. In deriving the explicit expression for the 
metric, the starting point is the Fourier phase of the waveform in the form [45]: 



*{f',tc,e u e 2 ) = 2nftc + a 01 e 1 x~ 5/3 + a 2l {e l /e 2 ) + a 22 {e l e 2 2 ) ,6 x - 1 + a 31 e 2 x 



-2/3 



+ 



04i {ei/el) + a 42 (e 1 /e 2 ) 1/3 + a 43 (0|/0i) 1/3 x^ 3 } (5. 



where the coefficients a^ m are given by 

_ 3 llvr 743 / 25 \ 1/3 -3 _ 617 2 

aoi -5' a21 -l2"' a22 "2016^J ' a31 "T' ° 41 -384 7r ' 

5429 /25tt\ 1/3 15293365 / 5 \ 1/3 

° 42 " 5376 \~2~) ° 43 " 10838016 \J^j [ ^ ] 

Using the above in Eq. (5.5), one can find an explicit expression for the metric. This 
expression is too unwieldy to write here, but it can be obtained by utilising the fact that, 
since the Fourier phase is a polynomial function, J can be expanded in terms of normalised 
moments J, where 

J(P) = j$j. (5-10) 

To assess the potential gains of using this coincidence method for inspiral analysis, it is 
useful to consider the difference in volume between the ellipsoidal region defined by g, and 
its bounding box aligned with the co-ordinate axes (tc,T ,r 3 ). This ratio can be calculated 
with the help of Eqs. (4.8). Fig. 2 shows how this ratio varies across the (to,t 3 ) space in 
the case of Initial and Advanced LIGO, Virgo and Einstein Telescope (a third generation 
European detector that is currently being designed). It can be seen that for most of the 
parameter space, the volume of the bounding box is an order of magnitude larger than the 
volume of the ellipsoid; however, in certain regions, corresponding to high masses, this ratio 
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FIG. 2: The log 10 of the ratio of the volume of the bounding box to the volume of the ellipsoid as a 
function of location in (tq,t^) space. The plots shown are (clockwise from top-right) for the initial 
LIGO, advanced LIGO, Virgo and Einstein Telescope. The low frequency cutoff is chosen to be 20 
Hz. 



can be as large as two orders of magnitude. This suggests that significant reductions of 
the background can be achieved by using ellipsoidal windows. Runs on example data sets 
suggest that in practice, the reduction in background coincident triggers due to using such 
a coincidence method will be a factor of ~ 10. 

To assess the improvement in the confidence in any candidate detection, it is helpful to 
look at how reducing the background rate by a factor of k will improve the odds O of a 
detection 



0{h\D) 



P{h\D) 



(5.11) 



P(0|£>) 
where P(h\D) is the posterior probability of a signal h being present given the set of triggers 

D has been obtained, and P(0\D) is the probability of there being no signal given D. We 

take the accidental trigger rate to be a Poisson process, with a trigger rate prior to reduction 

A. Assuming that the detection efficiency is not affected by the reduction in the trigger rate, 
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we see that the odds improves by the following factor: 

0(h\D) x/k l-e~ XT 



(5.12) 



0{h\D) x l- e - AT / fc ' 

where T is the duration of the run. 

The factor by which the odds of a signal being present improves by reducing the false 
alarm rate by a factor of k depends on how high the false alarm rate was to start with. If the 
initial false alarm rate is low (AT <C 1), the improvement in the odds approaches the factor 
k. However, for high false alarm rates, the improvement becomes less marked, tending to a 
factor of 1 as AT — > oo. 

VI. SUMMARY AND CONCLUSIONS 

A new method of coincidence analysis is proposed in which, instead of the rectangular 
windows on parameters conventionally used, ellipsoidal windows are employed based on the 
metric defined on the signal manifold. This allows us to use windows of appropriate size 
depending on the location in the parameter space, instead of using a phenomenological 
'best fit' choice of windows across the entire space. The algorithm has a massive practical 
advantage in that it requires the tuning of only one parameter irrespective of the number of 
dimensions of the parameters. This contrasts with the conventional method that required 
us to tune nearly as many parameters as the dimension of the parameter space. In addition, 
the method allows us to take into account covariances between parameters, thus significantly 
reducing the volume enclosed within the windows. In particular, for the case of non-spinning 
compact binary coalescence in Initial LIGO, it is expected that the use of such a method 
will reduce the background rate of coincident triggers by roughly an order of magnitude. By 
also incorporating SNR-dependence into the size of the windows, the background of high 
SNR events can be reduced even further. 

The algorithm has been implemented in C code in the LSC Algorithm Library (LAL) 
[47]. An implementation in (t c ,T ,r 3 ) space, as in Section V, using SNR-independent 
windows, is being employed in the search for compact binary coalescence in S5 data. This 
implementation is referred to as e-thinca [38]. 
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